AITopics | difference approximation

Collaborating Authors

difference approximation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

Kim, Eunsu, Suk, Juyoung, Kim, Seungone, Muennighoff, Niklas, Kim, Dongkwan, Oh, Alice

arXiv.org Artificial IntelligenceDec-30-2024

We introduce LLM-as-an-Interviewer, a novel paradigm for evaluating large language models (LLMs). This approach leverages multi-turn interactions where the LLM interviewer actively provides feedback on responses and poses follow-up questions to the evaluated LLM. At the start of the interview, the LLM interviewer dynamically modifies datasets to generate initial questions, mitigating data contamination. We apply the LLM-as-an-Interviewer framework to evaluate six models on the MATH and DepthQA tasks. Our results show that the framework effectively provides insights into LLM performance, including the quality of initial responses, adaptability to feedback, and ability to address follow-up queries like clarification or additional knowledge requests. The framework also addresses key limitations of conventional methods like LLM-as-a-Judge, including verbosity bias and inconsistency across runs. Finally, we propose the Interview Report, which aggregates insights from the interview process, providing examples and a comprehensive analysis of the LLM's strengths and weaknesses. This report offers a detailed snapshot of the model's real-world applicability. The code for our framework is publicly available at https://github.com/interview-eval/.

approximation, difference approximation, follow-up question, (16 more...)

arXiv.org Artificial Intelligence

2412.10424

Country:

Asia > Thailand > Bangkok > Bangkok (0.04)
Asia > Singapore (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
(7 more...)

Genre:

Research Report > New Finding (1.00)
Personal > Interview (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Data Mapping for Restricted Boltzmann Machine

You, Jiangsheng

arXiv.org Machine LearningSep-25-2019

R estricted Boltzmann machine (RBM) is two - layer neural nets constructed as a probabilistic model and i t s training is to maximiz e a product of probabilities by the contrastive divergence (CD) scheme . In this paper a data mapping is used to describe the relationship between visible and hidden layer s and the training is to minimize a squared error of the reconstructed visible layer by the gradient descent or a finite difference approximation . T his paper presents three new findings: 1) nodes on visible and hidden layers can take real - valued matrix dat a without a probabilistic interpretation; 2) the famous CD1 is a finite difference approximation of gradient descent after ignoring the second - order error; 3) activation can take non - sigmoid function s such as identity, relu and softsign. The data mapping p rovides a unified framework on dimensionality reduction, feature extraction and data representation pioneered and developed by Hinton and his colleagues . As an approximation of gradient descent, the finite difference learning is applicable to both directed and undirected graphs. N umerical results are performed to confirm these new findings on very low dimensionality reduction, matrix data and flexible activation s . Keywords: Restricted Boltzmann machine, data mapping, squared error, contrastive divergence, gradient descent and finite difference .

activation, mapping, rbm, (13 more...)

arXiv.org Machine Learning

1909.0821

Genre: Research Report (0.90)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.91)

Add feedback